On the complexity of inducing categorical and quantitative association rules
نویسندگان
چکیده
Inducing association rules is one of the central tasks in data mining applications. Quantitative association rules induced from databases describe rich and hidden relationships to be found within data that can prove useful for various application purposes (e.g., market basket analysis, customer profiling, and others). Although association rules are quite widely used in practice, a thorough analysis of the related computational complexity is missing. This paper intends to provide a contribution in this setting. To this end, we first formally define quantitative association rule mining problems, which include boolean association rules as a special case; we then analyze computational complexity of such problems. The general problem as well as some interesting special cases are considered.
منابع مشابه
Finding Association Rules From Quantitative Data Using Data Booleanization - Imberman and Domanski
Finding association rules in data that is naturally binary has been well researched and documented. Finding association rules in numeric/categorical data has not been as easy. Many quantitative algorithms work directly on the numeric data limiting the complexity of the generated rules. In addition, as you create intervals from the numeric data the dimensionality of the problem increases signifi...
متن کاملOn the Complexity of Mining Association Rules
In this paper we describe our ongoing research towards establishing the complexity of mining association rules from relational databases. We consider both quantitative, categorical and boolean association rules and various forms of quality indexes, including confidence, support, gain, laplace. The presented results show that all these problems are, generally, computationally hard to solve, even...
متن کاملA new approach based on data envelopment analysis with double frontiers for ranking the discovered rules from data mining
Data envelopment analysis (DEA) is a relatively new data oriented approach to evaluate performance of a set of peer entities called decision-making units (DMUs) that convert multiple inputs into multiple outputs. Within a relative limited period, DEA has been converted into a strong quantitative and analytical tool to measure and evaluate performance. In an article written by Toloo et al. (2009...
متن کاملMining association rules from qualitative and quantitative clustering
A comparison of mining association rules from clusters generated by qualitative clustering and clusters obtained by quantitative clustering is presented. Whereas in quantitative clustering only numerical data are included, numerical and categorical data are used in qualitative clustering for record conglomeration. The aim of this paper is to compare the performance of the two different kinds of...
متن کاملOptimizing Membership Functions using Learning Automata for Fuzzy Association Rule Mining
The Transactions in web data often consist of quantitative data, suggesting that fuzzy set theory can be used to represent such data. The time spent by users on each web page is one type of web data, was regarded as a trapezoidal membership function (TMF) and can be used to evaluate user browsing behavior. The quality of mining fuzzy association rules depends on membership functions and since t...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Theor. Comput. Sci.
دوره 314 شماره
صفحات -
تاریخ انتشار 2004